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Abstract. We establish properties of a new type of fractal which has partial self similarity 
at all scales. For any collection of iterated functions systems with an associated probability 
distribution and any positive integer V there is a corresponding class of U-variable fractal sets 
or measures with a natural probability distribution. These U-variable fractals can be obtained 
from the points on the attractor of a single deterministic iterated function system. Existence, 
uniqueness and approximation results are established under average contractive assumptions. 
We also obtain extensions of some basic results concerning iterated function systems. 

(N 
X> 

i 1. Introduction 

A V-variable fractal is loosely characterised by the fact that it possesses at most V distinct 
J/3 local patterns at each level of magnification, where the class of patterns depends on the level, 

£^ see Remark |5.2| Such fractals are useful for modelling purposes and for geometric applications 

which require random fractals with a controlled degree of strict self similarity at each scale, sec 
^ [Bar06| Chapter 5]. 

Standard fractal sets or measures determined by a single iterated function system [IFS] F 
acting on a metric space X such as R fc , can be generated directly by a deterministic process, or 
alternatively by a Markov chain or "chaos game", acting on X. Now let F be a family of IFSs 
acting on X together with an associated probability distribution on F . Let V be any positive 
integer. The corresponding class of V-variable fractal sets or measures from X, and its associated 
probability distribution, can be generated by a Markov chain or "chaos game" operating not on 
the state space X but on the state space C(X) V or A4(X) V of V-tuples of compact subsets of 
X or probability measures over X, respectively. See Theorems |6.4| and |6.6[ and see Section [8] 
for a simple example. The Markov chain converges exponentially, and approximations to its 
steady state attractor can readily be obtained. The projection of the attractor in any of the 
V coordinate directions gives the class of V-variable fractal sets or measures corresponding to 
F together with its natural probability distribution in each case. The full attractor contains 
• • further information about the correlation structure of subclasses of these V-variable fractals. 

. The case V = 1 corresponds to homogeneous random fractals. The limit V — > oo gives 

standard random fractals, and for this reason the Markov chain for large V provides a fast way 
of generating classes of standard random fractals together with their probability distributions. 
Ordinary fractals generated by a single IFS can be seen as special cases of the present construction 
and this provides new insight into the structure of such fractals, see Remark |9.4| For the 
connection with other classes of fractals in the literature see Remarks 15.221 and 16.91 

We summarise the main notation and results. 

Let (X, d) be a complete separable metric space. Typically this will be Euclidean space K fe 
with the standard metric. For each A in some index set A let F x be an IFS acting on (X, d), i.e. 

M 

(1.1) F x = (tf,...J x I ,w x ,...,w x d ), fi:X^X, 0<«£<1, X>*=1. 
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Figure 1 . Sets of V- variable fractals for different V and M. See Remark 19.41 

We will require both the cases where A is finite and where A is infinite. In order to simplify the 
exposition we assume that there is only a finite number M of functions in each F x and that M 
does not depend on A. Let P be a probability distribution on some rr-algebra of subsets of A. 
The given data is then denoted by 

(1.2) F = {(X,d),F\\e A,P}. 

Let V be a fixed positive integer. 

Suppose first that the family A is finite and the functions are uniformly contractive. The 
set ICv of V -variable fractal subsets of X and the set M.y of V -variable fractal measures on X 
associated to F is then given by Definition 5.1 There are Markov chains acting on the set C(X) V 



of V-tuples of compact subsets of X and on the set M C {X) V of ^-tuples of compactly supported 
unit mass measures on X, whose stationary distributions project in any of the V coordinate 
directions to probability distributions Ay on Ky and 9Jty onAii/, respectively. Moreover, these 
Markov chains are each given by a single deterministic IFS 5y or constructed from F and 
acting on C(X) V or M C (X) V respectively. The IFS's and are called superlFS's. The 
sets ICy and My, an d the probability distributions &v and 9Jlv, are called superfractals. See 
Theorem |6.4| some of these results were first obtained in |BHS05 . The distributions M.y and 
have a complicated correlation structure and differ markedly from other notions of random 
fractal in the literature. See Remarks 15.221 and 19.11 

In many situations one needs an infinite family A or needs average contractive conditions, see 
Example 6.8 In this case one works with the set A4i(X) v of ^-tuples of finite first moment 



unit mass measures on X. The corresponding superlFS Sy 11 is pointwise average contractive 



by Theorem 6.6 and one obtains the existence of a corresponding superfractal distribution 9JTi/ 



There are technical difficulties in establishing these results, see Remarks 3.5 3.6 and 6.7 

In Section [2] the properties of the the Monge-Kantorovitch and the strong Prokhorov proba- 
bility metrics are summarised. The strong Prokhorov metric is not widely known in the fractal 
literature although it is the natural metric to use with uniformly contractive conditions. We 
include the mass transportation, or equivalently the probabilistic, versions of these metrics, as 
we need them in Theorems |3.2| |6.4| |6.6| and |7.1| We work with probability metrics on spaces of 
measures and such metrics are not always separable. So in Section [2] the extensions required to 
include non separable spaces are noted. 

In Section [3j particularly Theorem |3.2| and the following Remarks, we summarise and in 
some cases extend basic results in the literature concerning IFS's, link the measure theoretic 
and probabilistic approaches and sketch the proofs. In particular, IFS's with a possibly infinite 
family of functions and pointwise average contractive conditions are considered. The law of large 
numbers for the corresponding Markov process starting from an arbitrary point, also known as 
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the convergence theorem for the "chaos game" algorithm, is extended to the case when the 
IFS acts on a non locally compact state space. This situation typically occurs when the state 
space is a function space or space of measures, and here it is required for the superlFS S"^ 1 in 
Theorem 
Theorem 



6.6 



3.2 



The strong Prokhorov metric is used in the case of uniform contractions. We hope 
will be of independent use. 
In Section [4] we summarise some of the basic properties of standard random fractals generated 
by a family of IFS's. 

In Section [5] the representation of V- variable fractals in the space f2y of tree codes and in the 
space Ay of addresses is developed, and the connection between the two spaces is discussed. 
The space fiy is of the type used for realisations of general random fractals and consists of trees 



with each node labelled in effect by an IFS, see the comment following Definition 4.1 and see 



Definition 5.1 The space Ay is of the type used to address points on a single deterministic 
fractal, and here consists of infinite sequences of V x (M + 1) matrices each of which defines a 
map from the set of ^-tuples of sets or measures to itself, see Definition |5.13| Also see Figures [2] 
and|U 

In Section [6] the existence, uniqueness and convergence results for U-variable fractals and 
superfractals are proved, some examples are given, and the connection with graph directed IFSs 
is discussed. In Section [7] we establish the rate at which the probability distributions &y and 
dJly converge to the corresponding distributions on standard random fractal sets and measures 
respectively as V — > 00. In Section [8] a simple example of a super IFS and the associated Markov 
chain is given. 

In Section [9] we make some concluding remarks including the relationship with other types 
of fractals, extensions of the results, and some motivation for the method of construction of 
^-variable fractals. 

The reader may find it easier to begin with Section [4] and refer back to Sections [2] and [3] as 
needed, particularly in the proofs of Theorems |6.4| and |6.6| An index of notation is provided at 
the end of the paper. 

This work was partially supported by the Australian Research Council and carried out at the 
Australian National University. 

2. Preliminaries 

Throughout the paper (X,d) denotes a complete separable metric space, except where men- 
tioned otherwise. 

Definition 2.1. The collection of nonempty compact subsets of X with the Hausdorff metric 
is denoted by (C(X),du)- The collection of nonempty bounded closed subsets of X with the 
Hausdorff metric is denoted by {BC{X), du). 

For A C X let A e = {x : d(x, A) < e} be the closed e-neighbourhood of A. 

Both spaces (C(X), du) and (BC(X), du) are complete and separable if (X, d) is complete and 
separable. Both spaces are complete if (X, d) is just assumed to be complete. 

Definition 2.2 (Prokhorov metric). The collection of unit mass Borel (i.e. probability) measures 
on the Borel subsets of X with the topology of weak convergence is denoted by M. (X) . Weak 
convergence of v n — ► v means J <f> dv n — > J <j)dv for all bounded continuous <f). The (standard) 
Prokhorov metric p on A4(X) is defined by 

p(H, v) := inf {e > : p(A) < v(A e ) + e for all Borel sets Ac X } . 

The Prokhorov metric p is complete and separable and induces the topology of weak conver- 



gence. Moreover, p is complete if (X,d) is just assumed to be complete. See Bil99 pp 72,73]. 
We will not use the Prokhorov metric but mention it for comparison with the strong Prokhorov 
metric in Definition 12.41 

The Dirac measure S a concentrated at a is defined by S a (E) = 1 if a 6 E and otherwise 

Sa(E) = 0. 
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If / : X — > X or / : X — > R, the Lipschitz constant for / is denoted by Lip / and is defined 
to be the least L such that d(f(x),f(y)) < Ld(x,y) for all x,y 6 X. 

Definition 2.3 (Monge-Kantorovitch metric). The collection of those /i S M.(X) with finite 
first moment, i.e. those \x such that 

(2.1) J d(a, x) dfi(x) < oo 

for some and hence any a G X, is denoted by A^i(X). The Monge-Kantorovitch metric dun on 
Mi(X) is defined in any of the three following equivalent ways: 

d M K{^,^') :=sup| J fdpL-j fd^' : Lip / < 1 j 

(2.2) = inf | y d(x, y) dj(x, y) : 7 a Borel measure on X x X, 7^(7) = /x, 772(7) = /i' 
= inf j E d(W, W) : dist W = /i, dist W" = //' 



The maps 7Ti , 7T2 : X x X — > X are the projections onto the first and second coordinates, and 
so and \j! are the marginals for 7. In the third version the infimum is taken over X-valued 
random variables W and W with distribution and fi' respectively but otherwise unspecified 
joint distribution. 

Here and elsewhere, "dist" denotes the probability distribution on the associated random 
variable. 



The metric space (Mi(X), d^iic) is complete and separable. The moment restriction (2.1l 



is automatically satisfied if (X,d) is bounded. The second equality in (2.2 1 requires proof, see 



Dud02 §11.8], but the third form of the definition is just a rewording of the second. The 



connection between duK convergence in A^i(X) and weak convergence is given by 
v n dMK > v iff v n — > v weakly and / d(x, a) dv n (x) — > / d{x,a) dv[x) 



for some and hence any a S X. See |Vil03 Section 7.2] 



Suppose (X,d) is only assumed to be complete. If measures fi in M\(X) are also required 



to satisfy the condition fi(X \spt//) = 0, then (Mi(X),<1mk) is complete, see KraOG and 



Fed69 §2.1.16]. This condition is satisfied for all finite Borel measures fi if X has a dense subset 



whose cardinality is an Ulam number, and in particular if (X, d) is separable. The requirement 



that the cardinality of X be an Ulam number is not very restrictive, sec Fcd69 §2.2.16]. 

It is often more natural to use the following strong Prokhorov metric rather than the Monge- 
Kantorovich or standard Prokhorov metrics in the case of a uniformly contractive IFS. 

Definition 2.4 {Strong Prokhorov metric). The set of compact support, or bounded support, 
unit mass Borel measures on X is denoted by A4 C (X) or A4f,(X), respectively. 

The strong Prokhorov metric dp is defined on Mb(X) in any of the following equivalent ways: 

dpfafj,') :=inf {e > : (i(A) < n'{A t ) for all Borel sets AcX } 

(2.3) =inf {ess sup 7 d(x,y) : 7 is a measure on X x X, 771(7) = 7T 2(l) = A*'} 

= inf {ess sup d(W, W) : dist W = fi, dist W' = fJ,'} , 



where the notation is as in the paragraph following (2.2 1. 
Note that 

M C (X) c M b (X) c Mi(X) c M(X). 



The first definition in (2.3) is symmetric in /1 and // by a standard argument, see Dud02 proof 



of Theorem 11.3.1]. For discussion and proof of the second equality see |Rac91 pl60 eqn(7.4.15)] 



and the other references mentioned there. The third version is a probabilistic reformulation of 
the second. 
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Proposition 2.5. (A4 c (X),dp) and (Mb(X),dp) are complete. If v,v' E A^b(A) then 

d n (sptv,sptv') < d P (i/,v'), d M K{y,v) < d P (v,v'). 

In particular, — > v in the dp metric implies vu — > v in the duK metric and sptvk — > sptv in 
the d-H metric. 

Proof. The first inequality follows directly from the definition of the Hausdorff metric and the 



second from the final characterisations in (2.2) and (2.3) 



Completeness of M.b{X) can be shown as follows and this argument carries across to Ai c {X). 



Completeness of M. C {X) is also shown in Fal86 Theorem 9.1] 



Suppose (Pk)k>~L Q {-Mb{X),dp) is dp-Cauchy. It follows that {i>k)k>i is djifj<--Cauchy and 
hence converges to some measure v in the duK sense and in particular weakly. Moreover, 
spt(^fc)fe>i converges to some bounded closed set K in the Hausdorff sense, hence sptz^ C K 
using weak convergence, and so v S M. h (X). Suppose e > and using the fact (vk)k>i is dp- 
Cauchy choose J so k, j > J implies Vk(A) < Vj^A 11 ) for all Borel A C X. By weak convergence 
and because A e is closed, limsup,^,^ iA, (^4 e ) < v(A li ) and so Vk{A) < v(A e ) if k > J. Hence 
i>k — *• v in the dp sense. □ 

If (X, d) is only assumed to be complete, but measures in Ai c (X) and M.b{X) are also required 



to satisfy the condition fj,(X \ spt fi) — as discussed following Definition 2.3 then the same 
proof shows that Proposition |2 . 5| is still valid. The main point is that one still has completeness 
of {Mi{X),d M K)- 

Remark 2.6 ( The strong and the standard Prokhorov metrics) . Convergence in the strong Prokhorov 
metric is a much stronger requirement than convergence in the standard Prokhorov metric or the 
Monge-Kantorovitch metric. A simple example is given by X — [0, 1] and v n = (1 — -)5q + ^S\. 
Then v n — > 5q weakly and in the duK and p metrics, but dp(v ni <5o) = 1 for all n. 

The strong Prokhorov metric is normally not separable. For example, if fi x = x5q + (1 — x)S\ 
for < x < 1 then dp(fj, x , fj, y ) — 1 for x ^ y. So there is no countable dense subset. 

If / : X — > X is Borel measurable then the pushforward measure f(y) is defined by f{y)(A) = 
v(f~ 1 (A)) for Borel sets A. The scaling property for Lipschitz functions /, namely 

(2.4) d P {f{n),f{v)) <Lip/d P ( M)i y), 

follows from the definition of dp. Similar properties are well known and easily established for 
the Hausdorff and Monge-Kantorovitch metrics. 

3. Iterated Function Systems 

Definition 3.1. An iterated functions system [IFS] F — (X,fe,0 <E Q,W) is a set of maps 
fg:X—fX for 9 € O, where (A, d) is a complete separable metric space and W is a probability 
measure on some tr-algebra of subsets of 0. The map (x, 9) t—> fe(x) : X x 9 — > A is measurable 
with respect to the product cr-algebra on A x 0, using the Borel cr-algebra on A. If — 
{1, . . . , M} is finite and W(m) = w m then one writes F = (A, fx, ... , /m, Wi, . . . , wm)- 

It follows / is measurable in 9 for fixed x and in x for fixed 9. Notation such as Eg is used to 
denote taking the expectation, i.e. integrating, over the variable 9 with respect to W. 

Sometimes we will need to work with an IFS on a nonseparable metric space. The properties 



which still hold in this case will be noted explicitly. See Remark 3.6 and Theorem 6.4 

The IFS F acts on subsets of A and Borel measures over A, for finite and infinite respec- 
tively, by 

M M 

F(E) - |J f m (E), F{u) = J2 w m f m {v), 

(3.1) 



F(E) = \Jf 6 (E), F(u)= fdW(9)f e (u). 

a J 
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We put aside measurability matters and interpret the integral formally as the measure which 
operates on any Borel set A C X to give JdW(6) (fg(v))(A). The latter is thought of as a 
weighted sum via W(9) of the measures fe(v)- The precise definition in the cases we need for 



infinite O is given by (3.3) 



If F(E) = E or F(y) = v then E or v respectively is said to be invariant under the IFS F. 

In the study of fractal geometry one is usually interested in the case of an IFS with a finite 
family O of maps. If X = M 2 then compact subsets of X are often identified with black and 
white images, while measures are identified with greyscale images. Images are generated by 
iterating the map F to approximate lim^oo F k (Eo) or lim/^oo F k {uo). As seen in the following 
theorem, under natural conditions the limits exist and are independent of the starting set E or 
measure Uq. 

In the study of Markov chains on an arbitrary state space X via iterations of random functions 
on X, it is usually more natural to consider the case of an infinite family 0. One is concerned 
with a random process Z*, in fact a Markov chain, with initial state x € X, and 

(3.2) Z*(i)=x, Zl{i):=f in {Z x n _ l {i)) = f in o...of h {x)Mn>l, 

where the i n £ are independently and identically distributed [iid] with probability distribution 
W and i = i\i<x ... . The induced probability measure on the set of codes i is also denoted by 
W. 

Note that the probability P(x,B) of going from x € X into B C X in one iteration is 
W{9 : f e {x) £ B}, and P(x, ■) = dist Zf. 

More generally, if the starting state is given by a random variable Xq independent of i with 
distX = v, then one defines the random variable Z^(i) = Z*°(i). The sequence (Z^(i)) n>0 
forms a Markov chain starting according to v. We define F(y) — dist Z\ and in summary we 
have 

(3.3) i/ := distZ^, F[y) := dist^f, F n {v) = dist Z v n . 

The operator F can be applied to bounded continuous functions 4> : X — » R via any of the 
following equivalent definitions: 

(3.4) (F (</>)) (x) = U{f e {x))dW{6) (or £> m #/ TO (a;))) = E e cf,(fg(x)) =E<f>(Z?). 

In the context of Markov chains, the operator F acting on functions is called the transfer operator. 
It follows from the definitions that J F(<f>) dfj, = J (pd(Ffi), which is the expected value of <j> after 
one time step starting with the initial distribution //. If one assumes F(<p) is continuous (it 
is automatically bounded) then F acting on measures is the adjoint of F acting on bounded 
continuous functions. Such F are said to satisfy the weak Feller property — this is the case if all 
fe are continuous by the dominated convergence theorem, or if the pointwise average contractive 



condition is satisfied, see SteOl 



We will need to apply the maps in (3.2) in the reverse order. Define 

(3.5) Z$(i)=x, Z*{i) = f il o...of in (x)l£n>l. 
Then from the iid property of the i n it follows 

(3.6) F n (u) = distZ^ = disti^, F n {(j))(x) = E<£(Z£) =E<f>(Z*). 

However, the pathwise behaviour of the processes Z^ and Z^ are very different. Under suitable 
conditions the former is ergodic and the latter is a.s. convergent, see Theorems |3.2|c and |3.2|a 



respectively, and the discussion in DF99 



The following Theorem |3.2| is known with perhaps two exceptions: the lack of a local com- 
pactness requirement in (c) and the use of the strong Prokhorov metric in (d) . 

is a more 



The strong Prokhorov metric, first used in the setting of random fractals in |Fal86 



natural metric than the Monge-Kantorovitch metric when dealing with uniformly contractive 
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maps and fractal measures in compact spaces. Either it or variants may be useful in image 



compression matters. In Theorem 6.4 we use it to strengthen the convergence results in BHS05 



The pointwise ergodic theorems for Markov chains for any starting point as established in 



Bre60 Elt87 BEH89 , Elt90 , MT93 require compactness or local compactness, see Remark 3.5 



We remove this restriction in Theorem 13.21 The result is needed in Theorem 16.61 where we 
consider an IFS operating on the space (Aii(X) v , cLmk) of F-tuples of probability measures. 
Like most spaces of functions or measures, this space is not locally compact even if X = M fc . See 
Remark 13.51 and also Remark 16.81 

We assume a pointwise average contractive condition, see Remark |3.3| We need this in 
Theorem |6.6[ see Remark |6.7| 

The parts of the theorem have a long history. In the Markov chain literature the contraction 
conditions (3.7 1 and (3.131 were introduced in |Isa62| and |DF37| res pectively in order to estab- 
lish ergodicity. In the fractal geometry literature, following |Man77)|Man82 , the existence and 
uniqueness of attractors, their properties, and the Markov chain approach to generating fractals, 



were introduced in |Hut81 BD85 DS86 BE88 Elt87 Elt90 . Sec Stc98 SteOl DF99 for further 



developments and more on the history. 

Theorem 3.2. Let F = (X, /§, 9 £ 0, W) be an IFS on a complete separable metric space (X, d). 
Suppose F satisfies the pointwise average contractive and average boundedness conditions 

(3.7) E0d(fe(x),fe(y))<rd(x,y) and L :— Eg d(fe(a),a) < oo 
for some fixed < r < 1, all x,y £ X , and some a £ X . 

a. For some function H, all x, y £ X and all n, we have 

(3.8) Ed(Z*(i),Z*(t)) = Ed(Z*(«),2*(t)) < r n d(x,y), Ed(Z*(t),n(«)) < l*r n , 

where 7 X = Eg d(x, fe(x)) < d(x,a) + L/{1 — r) and H(i) is independent of x. The map II is 
called the address map from code space into X. Note that H is defined only a.e. 

If r < s < 1 then for all x, y € X for a.e. i — . . . i n . . . there exists no = no(i, s) such that 

(3.9) d(Z*(i),Zy(i)) < s n d(x,y), d(Z*(i),Zl(i)) < s n d(x,y), d(^(t),n(t)) < J x s n , 
for n > no . 

b. If v is a unit mass Borel measure then F n [y) — * \x weakly where \i := II(W) is the projection 
of the measure W on code space onto X via II. Equivalently, (i is the distribution ofH regarded 
as a random variable. In particular, 

F( M ) = n 

and fi is the unique invariant unit mass measure. 

The map F is a contraction on (A4i(X), dMK) with Lipschitz constant r. Moreover, \i € 
M\{X) and 

(3.10) d MK {F n {v)^) < r n d MK (v^) 
for every v € Mi(X). 

c. For all x £ X and for a.e. i, the empirical measure (probability distribution) 

n 



(3.11) 



k—l 



weakly. Moreover, if A is the support of fj, then there exists no = n (i,x,e) such that 

(3.12) Z*{i)cA e ifn>n . 

d. Suppose F satisfies the uniform contractive and uniform boundedness conditions 

(3.13) sup 9 d(fe(x),fe(y)) < rd(x,y) and L := sup e d(fg(a),a) < oo 
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for some r < 1, all x,y G X , and some a G X . Then 

(3.14) d{Zl{i),Zl{i)) < r n d(x,y), d(Z*(i) , Z«(i)) < r n d(x,y), d{Z*{i) ,U(i)) < lx r n , 

for all x,y G X and all i. The address map II is everywhere defined and is continuous with 
respect to the product topology defined on code space and induced from the discrete metric on 0. 
Moreover, [i = n(W) G M c and for any v G Mb, 

(3.15) d P {F n {v), t i)<r n d P {v,n). 

Suppose in addition O is finite and W({0}) > for 9 G 0. Then A is compact and for any 
closed bounded E 

(3.16) d n ((F n (E),A)<r n d n (E,A). 
Moreover, F(A) = A and A is the unique closed bounded invariant set. 



Proof, a. The first inequality in (3.8) follows from (3.2 1, (3.5 1 and contractivity. 
Next fix x. Since 



(3.17) 



Ed(Z» +1 (i),Z«(i)) < rEd(Z^i),Z^i)) 



for all n, it follows that Z®{i) and Z*(i) a.s. converge exponentially fast to the same limit II(i) 
(say) by the first inequality in ( |3.8[ ). It also follows that (3.171 is simultaneously true with i 
replaced by ik+iik+2ik+3, ■ ■ ■ for every k. It then follows from (3.5 1 that 



n(i) =/h o... o fi k (U(i k+1 i k+2 i k+3 ...)). 



Again using (3.5) 



E d(Z^(i),U{i)) < r n E d{x, U(i n+1 i n+2 i n+3 ...))= r n E d(x, n(»)) < r n (d(x, a) + E d(a, n(i))) . 
But 

E d(a, n(i)) < E d(a, f n (a)) + ^ E d(/ 4l o . . • o f in (a), o ■ ■ ■ o f in+1 (a)) < ^ r n L = ^— . 



n>l 



n>0 



This gives the second inequality in (3.8 1. See Ste98 for details 



The estimates in (3.9 1 are the standard consequence that exponential convergence in mean 



implies a.s. exponential convergence. 

b. Suppose (j> G CBC(X, d), the set of bounded continuous functions on X. Let v be any unit mass 
measure. Since for a.e. i, Z*(i) — > II(i) for every x, using the continuity of <p an d dominated 
convergence, 



(j)d{F n v) 



>d(dist££) (by (3.61) 



<j){T\{i)dW{i)dv = \ <t>dn 



by the definition of fi for the last equality Thus F n (v) —> fi weakly. The invariance of /i and 
the fact fj, is the unique invariant unit measure follow from the weak Feller property. 

One can verify that F : A4i(X) — ► A4i(X) and F is a contraction map with Lipschitz constant 
r in the duK metric. It is easiest to use the second or third form of (2.2 1 for this. The rest of 
(b) now follows. 

c. The main difficulty here is that (X, d) may not be locally compact and so the space 'BC(X, d) 



need not be separable, see Remark 3.5 We adapt an idea of Varadhan, see Dud02 p399, Thm 
11.4.1]. 

There is a totally bounded and hence separable, but not usually complete, metric e on X 
such that (X, e) and (X, d) have the same topology, see |Dud02| p72, Thm 2.8.2]. Moreover, as 
the proof there shows, e(x, y) < d(x, y). Because the topology is preserved, weak convergence of 
measures on (X, d) is the same as weak convergence on (X, e). 
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Let H>£j(X, e) denote the set of bounded Lipschitz functions over (X, e). Then r BL(X, e) is 
separable in the sup norm from the total boundedness of e. 

Suppose (f> £ CBil(X, e). By the ergodic theorem, since /i is the unique invariant measure for F, 



(3.18) 



/ 



4>dnl(i) 



1 



k=l 



for a.e. i and /i a.e. y. 

Suppose a; G X and choose y 6 X such that ( |3.18[ ) is true. Using (a), for a.e. i 

e{Zl{i),Zl{i)) <«(i),2»(t))-0. 



It follows from (3.18) and the uniform continuity of <f> in the e metric that for a.e. i, 

1 



(3.19) 



E 

k = l 



Let 5 be a countable dense subset of l BL(X 1 e) in the sup norm. One can ensure that (3.191 
is simultaneously true for all <f> £ S. By an approximation argument it follows ( 3.19[ ) is simulta- 
neously true for all </> £ r BL(X, e). 

Since (A, e) is separable, weak convergence of measures v n — > is equivalent to J" 4>dv n — > 
J 4>dv for all € 23£(A, e), see Dud02 Thm 11.3.3, p395]. Completeness is not needed for this. 
It follows that — > [i weakly as required. 



The result (3.121 follows from the third inequality in (3.9 1 



d. The three inequalities in (3.14) are straightforward as are the claims concerning II. 



It follows readily from the definitions that each of Ai c (X), A4b(X), C(X) and BC(X) are 
closed under F, and that F is a contraction map with respect to dp in the first two cases and 
dn in the second two cases. The remaining results all follow easily. □ 

Remark 3.3 (Contractivity conditions) . The pointwise average contractive condition is implied by 
the global average contractive condition Eg rg := J rg dW(9) < r, where rg := Lip fg. Although 
the global condition is frequently assumed, for our purposes the weaker pointwise assumption is 



necessary, see Remark 6.7 



In some papers, for example |DF99 WWOO , parts of Theorem 3.2 are established or used 



under the global log average contractive and average boundedness conditions 



(3.20) 



E e logr e <0, Egr g q <oo, E g d q (a, fg{a)) < oo, 



for some q > and some a £ X. However, since d q is a metric for < q < 1 an d since 
(Eg g q (9)) 1 ^ q I exp (Eg log g(0)) as q J. for g > 0, such results follow from Theorem 



3.2 



In the 

main Theorem 5.2 of |DF99| the last two conditions are replaced by the equivalent algebraic tail 
condition. 

One can even obtain in this way similar consequences under the yet weaker pointwise log 



average conditions. See also Elt87 



Pointwise average contractivity is a much weaker requirement than global average contractiv- 
ity A simple example in which fg is discontinuous with positive probability is given by < e < 1, 
X = [0, 1] with the standard metric d, O = [0, 1], W{0} = W{1} = e/2, and otherwise W is 
uniformly distributed over (0, 1) according to W{(a,b)} = (1 — e)(b — a) for < a < b < 1. 
Let fg = be the characteristic function of [9, 1] for < 9 < 1 and let fx = be the zero 

function. Then Eg d(fg(x), fg(y)) < (1 — e)d(x, y) for all x and y. The unique invariant measure 
is of course ISq + \ 5\. Uniqueness fails if e = 0. A simple example where fg is discontinuous with 



probability one is fg = ^{1} and 9 is chosen uniformly on the unit interval. See Ste98 SteOl 
for further examples and discussion. 



Remark 3.4 (Alternative starting configurations). One can extend (3.8 1 by allowing the starting 
point x to be distributed according to a distribution v and considering the corresponding random 
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variables Z^(i). Then (3.10) follows directly by using the random variables Z%(i) and Z£(i) in 
the third form of dZ2|. 



Analogous remarks apply in deducing the distributional convergence results in Theorem|3.2|d 



from the pointwise convergence results, via the third form of (2.3 1 



Remark 3.5 (Local compactness issues). Versions of Theorem |3.2| c for locally compact (X, d) 
were establishe d in |Bre60[[Elt87[|BEH^|Ert90[pT93| . In that case one first proves vague 
convergence in ( 3.11| . By v n — > v vaguely one means J 4>dv n — » / 4>dv for all cf> 6 G C (X) where 
G C (X) is the set of compactly supported continuous functions tj> : X — ► K. The proof of vague 
convergence is straightforward from the ergodic theorem since G C (X) is separable. Moreover, 
in a locally compact space, vague convergence of probability measures to a probability measure 
implies weak convergence. That this is not true for more general spaces is a consequence of the 
following discussion. 

The e xten sion of Theorem |3.2| c to non locally compact spaces is needed in Section [6] and 
Theorem 6.6 In order to study images in M. k we consider IFS's whose component functions act 
on the space (A4i(M. k ) v , djvfif ) of V-tuples of unit mass measures over K fc , where V is a natural 
number. Difficulties already arise in proving that the chaos game converges a.s. from every initial 
V-tuple of sets even for V = k = 1 . 

To see this suppose v$ € A4i(M) and e > 0. Then B c (vq) := {v : duK(v,vo) < e} is not 
sequentially compact in the duK metric and so (A4i(R), dun) is not locally compact. To show 

sequential compactness does not hold let v n — ( 1 J uq H T n Vo, where T n (x) = x + n is 

\ nJ n 

translation by n units in the x-direction. Then clearly v n — > weakly. Setting f(x) = x 



in (2.21 



duK^n, vq) > j x dv n - j x duo = (l - —J xdu + - (x + n) dv$ 



x di>{) = e. 



e 

nJ j n 

On the other hand, let W be a random measure with dist W = Vq. Independently of the value 

of W let W = W with probability 1 and W — r n W with probability — . Then again 

n n 



from (2.2), 



d MK {v n ,vb) < Ed MK (W, W')=(l--)x0+-n=e. 

It follows that v n € B e (vo) and v n v in the duK metric, nor does any subsequence. Since 
dMK implies weak convergence it follows that (v n )n>\ bas no convergent subsequence in the 
duK metric. 

It follows that C c (A^i(K), dun) contains only the zero function. Hence vague convergence in 
this setting is a vacuous notion and gives no information about weak convergence. 

Finally, we note that although (c) is proved here assuming the pointwise average contractive 
condition, it is clear that weaker hypotheses concerning the stability of trajectories will suffice 
to extend known results from the locally compact setting. 

Remark 3.6 (Separability and measurability issues). If (X, d) is separable then the class of Borcl 
sets for the product topology on X x X is the product er-algebra of the class of Borel sets on X 
with itself, see Bil99 p 244]. It follows that 9 i— >■ d(fg(x), fo(y)) is measurable for each x, y 6 X 
and so the quantities in ( |3.7[ ) are well defined. 

Separability is not required for the uniform contractive and uniform boundedness conditions 
in ( 3.13[ ) and the conclusions in (d) are still valid with essentially the same proofs. The spaces 
Ai c (X), Aib(X) and Aii(X) need to be restricted to separable measures as discussed following 



Definition (2.3) and Proposition 2.5 



Separability is also used in the proof of (c). If one drops this condition and assumes the uni- 
form contractive and boundedness conditions ( |3.13[ ) then a weaker version of (c) holds. Namely, 
for every x £ X and every bounded continuous cf> € HG(X), for a.e. i 



(3.21) 



> dfx. 
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The point is that unlike the situation in (c) under the hypothesis of separability, the set of such 
i might depend on the function <j). 

In Theorem |6.4| we apply Theorem |3.2| to an IFS whose component functions operate on 
(Ai c (X) v , dp) where V is a natural number. Even in the case V = 1 this space is not separable, 
see Example 1 2. 6 1 

4. Tree Codes and Standard Random Fractals 



Let F = {X, F \ A G A, P} be a family of IFSs as in (fTTTJ and ( JP) . We assume the IFSs F 



are uniformly contractive and uniformly bounded, i.e. for some < r < 1, 

(4.1) sup A max m d(/^(x),/^(?;)) < rd(x,y) and L := sup A max TO d(f^(a), a) < oc 
for all x,y G X and some a £ X. More general conditions are assumed in Section [6] 

We often use * to indicate concatenation of sequences, either finite or infinite. 

Definition 4.1 (Tree codes). The tree T is the set of all finite sequences from {1,...,M}, 
including the empty sequence 0. If a = o\ . . . o~k G T then the length of a is \a\ = k and |0| = 0. 
A tree code a; is a map u> : T — > A. The metric space (f2, d) of all tree codes is defined by 

(4.2) n = {u\u;:T^A}, d(u,u/)=-^ 

if lo(o~) — w'(a") for all cr with |cr| < k and w(cr) ^ uj'(a) for some cr with \o~\ = k. 
A finite tree code of height k is a map w : {cr G T : |cr| < k} — > A. 

If a; E ft and r E T then the tree code wjr is defined by (wjr) (cr) := w(r * cr). It is the tree 
code obtained from ui by starting at the node r. One similarly defines oj\t if lj is a finite tree 
code of height fc and |r| < k. 

If a; E ft and fc is a natural number then the finite tree code w\_k defined by (uj[k)(a) — uj(o~) 
for \a\ < k. It is obtained by truncating w at the level k. 

The space (ft,ci) is complete and bounded. If A is finite then (ft,d) is compact. 
The tree code u> associates to each node a G T the IFS F^^. It also associates to each a =/= 



the function fm a ' where a = a' * m. The M k components of K% in (4.3 1 are then obtained 
by beginning with the set Kq and iteratively applying the functions associated to the k nodes 
along each of the M k branches of depth k obtained from ui. A similar remark applies to 

Definition 4.2 (Fractal sets and measures). If Kq G C(X) and /iq G M c (X) then the prefractal 
sets K£, the prefractal measures the fractal set K u and the fractal measure fi 03 , are given 
by 

K%= |J f^of^of^^o-.-of^--^^), 
o£T, \a\=k 

(4-3) n% = ]T <^<^ l) -----<t 1 --- ak - l) f^ $) °f^ 

creT, \a\=k 

= lim K%, \i" = lim 

k — >oo k — >oo 

It follows from uniform contractivity that for all u) one has convergence in the Hausdorff and 
strong Prokhorov metrics respectively, and that K u and fj, u are independent of Kq and [1q . 
The collections of all such fractals sets and measures for fixed {F a }a<ea are denoted by 

(4.4) /Coo = {K" : u> G ft}, Moo = {fx" : u G ft}. 
For each fc one has 

(4.5) K" = |J where ^ := o /£"0 o /#<™) o • • • o /^•••^-O 

kl=* 

The M fe sets K% are called the subfractals of if" at level k. 
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The maps lu i— * and lu > /i" are Holder continuous. More precisely: 



Proposition 4.3. With L and r as in (4.1 1, 

2L 



(4.6) 



1 - r 



d a (uj,uj') and d P (n",fj,")< 



2L 
1-r 



where a = log(l/r) / log M . 



Proof. Applying (4.3 1 with Kq replaced by {a}, and using (4.1l and repeated applications of 
the triangle inequality, it follows that d^K 1 ^ ,a) < (1 + r + r l + - • - )L = L/(l — r) and so 
d H (K u ',K u '') < 2L/(1 - r) for any lu and J. If d(uu,Lu') = M~ k then w{a) = uu'(a) for 
\a\ < k, and since dn(K u ^, K"'^ a ) < 2L/(1 - r), it follows from p~5| and contractivity that 
dniK^.K^') < Since r fc = M" fca = cF(w,u/), the result for sets follows. 



The proof for measures is essentially identical; one replaces po by S a in (4.3 1. 



□ 



2 

A 

A G A 

Aa G \ 

A G A _ A^A 

G *~ G 



2 

A 

A G A 

A G A _ A G A 

G G 



AA AAa^a A AA AA a\ 



FGF 

\i/ \! 



r G= F G (3 If Hi If 

\i/ \;/ \i/ 

G F G_ 



Figure 2. Random Sierpinski triangles and tree codes 

Example 4.4 {Random Sierpinski triangles and tree codes). The relation between a tree code o> 
and the corresponding fractal set K u can readily be seen in Figure [2] The IFSs F — (/i, fa, fs) 
and G = (31,32,53) act on R 2 , and the f m and 3 m are similitudes with contraction ratios 1/2 
and 1/3 respectively and fixed point m. If a node is labelled F, then reading from left to 
right the three main branches of the subtree associated with that node correspond to f\,f%, fa 
respectively. Similar remarks apply if the node is labelled G. 

If the three functions in F and the three functions in G each arc given weights equal to 1/3 
then the measure (i u is distributed over K u in such a way that 1 /3 of the mass is in each of the 
top level triangles, 1/9 in each of the next level triangles, etc. 

The sets K u and measures p? in Definition |4.2| are not normally self similar in any natural 
sense. However, there is an associated notion of statistical self similarity. For this we need the 
following definition. 



The reason for the notation p^ in the following definition can be seen from Theorem 7.1 

Definition 4.5 (Standard random fractals). The probability distribution on CI is defined by 
choosing Lu(a) £ A for each a G T in an iid manner according to P. 

The random set K = u> <— > and the random measure M = lu t— > p u , each defined by 
choosing lu € f2 according to p^ , are called standard random fractals. 

The induced probability distributions on K-oo and A4oc respectively are defined by M.^ = dist K 
and 9Jl x = dist M. 
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It follows from the definitions that IK and M are statistically self similar in the sense that 

(4.7) distK = distFQKi, . . . ,K M ), dist M = dist F(Mi, . . . , M M ), 

where F is a random IFS chosen from (F x )\<za according to P, Ki, . . . , Km are iid copies of K 
which are independent of F, and Mi, ... , Mm are iid copies of M which are independent of F. 

Here, and in the following sections, an IFS F acts on M-tuples of subsets K\, . . . , Km of X 
and measures fj,i, . . . , (Mm ° ver A by 



M 



M 



(4.8) 



F{K X , K M ) = (J f m (K m ), F(m,. . . , fi M ) = ^ w mfm(^m.)- 



This extends in a pointwise manner to random IFSs acting on random sets and random measures 
as in (4.7 1. 

We use the terminology "standard" to distinguish the class of random fractals given by Def- 
X5]and discussed in ( |Fal86[|MW86]|Gra87[|HR98[|HR00| ) from other classes of random 



mition 



fractals in the literature. 



5. V- Variable Tree Codes 



5.1. Overview. We continue with the assumptions that F = {X,F X ,X € A,P} is a family 
of IFSs as in (1.1) and (1.2|, and that {F x }\eA satisfies the uniform contractive and uniform 
bounded conditions (4.1 1. In Theorem 16.61 and Example 6.8 the uniformity conditions are re- 



placed by pointwise average conditions. 



In Section |5.2| Definition |5.1| we d efine the set fly C f2 of V- variable tree codes, where 

Since the {i^ A }^ g A are uniformly contractive this 



4.1 



is the set of tree codes in Definition 
leads directly to the class ICy of V-variable fractal sets and the class My of V-variable fractal 
measures. 

fly is alternatively obtained from an IFS 4>y — (£l v ,Q a ,a € Ay) acting on 



In Section 



5.3 



f2 . More precisely, the attractor fl v c f2 of <fv projects in any of the V-coordinate directions 
to fly. However, Q v ^ (Qv) V and in fact there is a high degree of dependence between the 
coordinates of any ui = (u>i, . . . , ujy) S Cly. 



If 



U) = lim $ a ° O $ ai O • • • O (UJ°, . . . , OJy) 

k — >oo 



we say a; has address agai 



■ a k ■ 



The limit is independent of (wj, . . . ,u>y). 



In Section 5.4 a formalism is developed for finding the X^-tuple of tree codes (a>i, . . . , u>y) from 
the address ao<Zi • • • <Zfc • • • , see Proposition |5 . 1 6] and Example |5.17| Conversely, given a tree code 
lo one can find all possible addresses a^a\ . . . a k ■ . ■ of V^-tuples (u>i, . 
uji = oj. 



■ ,UJy) e fly 



for which 



In Section [575] the probability distribution py on the set fly of ^-variable tree codes is defined 
and discussed. The probability P on A first leads to a natural probability Py on the index set 
Ay for the IFS *y, see Definition |5~18| This then turns *y into an IFS (fi y ,$ a ,a € Ay,Py) 



with weights whose measure attractor p* v is a probability distribution on its set attractor fl v . 
The projection of p v in any coordinate direction is the same, is supported on fly and is denoted 
by py, see Theorem |5.21| 



5.2. V- Variability. 

Definition 5.1 (V -variable tree codes and fractals). A tree code w £ is V -variable if for each 
positive integer k there are at most V distinct tree codes of the form uj\t with \t\ = k. The set 
of V- variable tree codes is denoted by fly. 

Similarly a finite tree code lu of height p is V -variable if for each k < p there are at most V 
distinct finite subtree codes uj\t with |r| = k. 
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For a uniformly contractive family {F*}aga of IFSs, if lu is V- variable then the fractal set 



K u and fractal measure in (4.3 1 are said to be V -variable. The collections of all V- variable 
sets and measures corresponding to {^ a }agA are denoted by 

(5.1) K v = {K w : u e Oy}, My = : w € fty} 



respectively, c.f. (4.4 1 



If V = 1 then a; is V-variable if and only if \a\ = \a'\ implies cj(ct) = ui(a'), i.e. if and only if 
for each fc all values of lu(<j) at level k are equal. In the case V > 1, if a; is V- variable then for 
each fc there are at most V distinct values of lu(<j) at level fc — \a\, but this is not sufficient to 
imply V-variability. 

Remark 5.2 (V -variable terminology). The motivation for the terminology "V- variable fractal" is 
as follows. Suppose all functions belong to the same group G of transformations. For example, 
if X = R™ then G might be the group of invertible similitudes, invertible affine transformations 
or invertible projective transformations. Two sets A and B are said to be equivalent modulo G 
if A = g(B) for some g € G. If K u is V-variable and k is a positive integer, then there are at 
most V distinct trees of the form Lu\a such that \a\ = k. If |er| = |o-'| = fc and Lu\a — Lu\a' , then 



from (4.5) 



(5.2) K:= 5 (^)where g = o ■ ■ ■ o/^---) o (/^••' <7 ^ ) ) c.o(/-W) 

In particular, K£ and are equivalent modulo G. 

Thus the subfractals of at level A: form at most V distinct equivalence classes modulo G. 
However, the actual equivalence classes depend upon the level. 

Similar remarks apply to V-variable fractal measures. 

Proposition 5.3. A tree code lu is V -variable iff for every positive integer k the finite tree codes 
u[k are V -variable. 

Proof. If lu is V- variable the same is true for every finite tree code of the form u\k. 

If to is not V- variable then for some k there are at least V + 1 distinct subtree codes to\ t with 
|t| = fc. But then for some p the V + 1 corresponding finite tree codes (wJt)Lp must also be 
distinct. It follows uj[(k + p) is not V- variable. □ 

Example 5.4 (V -variable Sierpinski triangles) . The first tree code in Figure[2]is an initial segment 
of a 3-variable tree code but not of a 2-variable tree code, while the second tree is an initial 
segment of a 2-variable tree code but not of a 1-variable tree code. The corresponding Sierpinski 
type triangles are, to the level of approximation shown, 3-variable and 2-variable respectively. 

Theorem 5.5. The fly are closed and nowhere dense in fl, and 

n v cn v+1 , d H (n v ,Q)<^, \Jn v c\jn 

where the bar denotes closure in the metric d 



V>1 V>1 



Proof. For the inequality suppose u> € f2 and define fc by M k < V < M k+1 . Then if u>' is 
chosen so oj'(a) = w(er) for \a\ < k and co'(a) is constant for \a\ > fc, it follows lu' g fly and 
d(Lu',Lu) < Af _ ( fc+1 ) < V -1 , hence dft(Qy,£l) < V^ 1 . The remaining assertions are clear. □ 

5.3. An IFS Acting on V- Tuples of Tree Codes. 

Definition 5.6. The metric space (£l v ,d) is the set of V-tuples from f2 with the metric 

d((u>i . ..luv), {uj'i ■ ■■u' v )) = max . d(u) v ,u)' v ), 

Kv<V 



where d on the right side is as in Definition 4.1 
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This is a complete bounded metric and is compact if A is finite since the same its true for 
V = 1. See Definition 14 . 1 1 and the comment which follows it. The induced Hausdorff metric on 
BC(il v ) is complete and bounded, and is compact if A is finite. See the comments following 
Definition EH 



The notion of V- variability extends to V-tuplcs of tree codes, V-tuples of sets and V-tuples 
of measures. 

Definition 5.7 (V-variable V-tuples). The V-tuple of tree codes u> = £ £l v is 

V -variable if for each positive integer k there are at most V distinct subtrees of the form u> v \ a 
with v G {1, . . . , V} and \o~\ = k. The set of V-variable V-tuples of tree codes is denoted by Qy. 

Let {F a }^ g a be a uniformly contractive family of IFSs. The corresponding sets Ky of In- 
variable V-tuples of fractal sets, and M. v of V-variable V-tuples of fractal measures, are 

JC*y = {(K"\ . . . ,K"v) : K . . . ,utv) G n* v }, My = {{^\. . . , : (uii, . . . ,u>v) G n* v }, 

where if"" and fjf" are as in Definition |4.2| 

Proposition 5.8. The projection of£l v in any coordinate direction equals ily, however il' v C (Oy) 1 

Proof. To see the projection map is onto consider (u>, . . . ,uS) for u> G fly. To see £l v 5 (£ly) v 
note that a V-tuple of V-variable tree codes need not itself be V-variable. □ 

Notation 5.9. Given A G A and u>i, . . . , ujm G £1 define u) — A * (u>i, . . . , ojm) G f2 by w(0) = A 
and u(m<j) = u) m {a). Thus A * (u>i, . . . , is the tree code with A at the base node and the 
tree u> m attached to the node m for m = 1, . . . , M. 

Similar notation applies if the tui, . . . , ujm are finite tree codes all of the same height. 

We define maps on V-tuples of tree codes and a corresponding IFS on il v as follows. 

Definition 5.10 (The IFS acting on the set of V-tuples of tree codes). Let V be a positive 
integer. Let Ay be the set of all pairs of maps a = (I, J) = (i a , J a ), where 

I : {1, . . . , V} -> A, J : {1, . . . , V} x {1, . . . , M} -> {1, . . . , V}. 

For a G Ay the map $ a : £l v — > il v is defined for a; = (wi, . . . , wy) by 

(5.3) $ Q (a;) = ($•(«), . . . , $",H), $S(u>) = * ■ ■ -^J'{v,M))- 

Thus ^"(uj) is the tree code with base node I a (v), and at the end of each of its M base branches 
are attached copies of ujar v ,i), ■ ■ ■ ,wj«(„,m) respectively. 

The IFS &v acting on V-tuples of tree codes and without a probability distribution at this 
stage is defined by 

(5.4) $ v :=(n v ,$ a ,aeA v ). 
Note that $ a : fly — > fl v for each a G .Ay. 

Notation 5.11. It is often convenient to write a = (i a , J a ) G Ay in the form 

~i a (l) J a (l,l) ... J a (l,M)~ 

(5.5) a= : : •.. : 

i a (V) J a (V,l) ■■■ J a {V,M)_ 

Thus Ay is then the set of all V x (1 + M) matrices with entries in the first column belonging 
to A and all other entries belonging to {!,..., V}. 



Theorem 5.12. Suppose €>y = (il ,<I> a ,a G Ay) is an IFS as in Definition 5.10 with A 
possibly infinite. Then each <!> a is a contraction map with Lipschitz constant \/ M. M oreover, 
with 4>y acting on subsets of £l v as in (3.1 1 and using the notation of Definition 2.1 we have 



*&y : BC(il v ) — > BC(il v ) and 4>y is a contractive map with Lipschitz constant 1/M. The unique 
fixed point of &y is £l v and in particular its projection in any coordinate direction equals fly. 
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Proof. It is readily checked that each <& a is a contraction map with Lipschitz constant 1/M. 

We can establish directly that $v{E) := Uae.4 & a (E) is closed if E is closed, since any 
Cauchy sequence from &y(E) eventually belongs to Q a (E) for some fixed a. It follows that 3>y 
is a contraction map on the complete space (BC(fl v ), d-j-c) with Lipschitz constant 1/M and so 
has a unique bounded closed fixed point (i.e. attractor). 

In order to show this attractor is the set fl v from Definition |5.7| note that fl v is bounded 
and closed in Q v . It is closed under <& a for any a <G Ay as noted before. Moreover, each u> 6 fly 
is of the form <& a (u/) for some a/ £ fl v and some (in fact many) <& a . To see this, consider 
the VM tree codes of the form io v \m for 1 < v < V and 1 < m < M, where each m is the 
corresponding node of T of height one. There are at most V distinct such tree codes, which we 



denote by <jj[ , . 



j' v , possibly with repetitions. Then from ( |5.3| 
(u u ...,uj v ) = $ a K,...,a^), 



provi 



ded 



I a (v) 



J a (v,m) 



uj v [m. 



So fly is invariant under <&y and hence is the unique attractor of the IFS &\ 



□ 



In th e pr evious theorem, although &y is an IFS, neither Theorem 3.2 nor the extensions in 
apply directly. If A is not finite then fl v is neither separable nor compact. Moreover, 



3.6 



Remark 

the map 3>y acts on sets by taking infinite unions and so we cannot apply Theorem (3.2 1. d to 
find a set attractor for <fry, since in general the union of an infinite number of closed sets need 
not be closed. 

As a consequence of the theorem, approximations to ^-variable ^-tuples of tree codes, and in 
particular to individual V- variable tree codes, can be built up from a y-tuple r of finite tree codes 
of height such as r = (A*, . . . , A*) for some A* S A, and a finite sequence ao, 02, • ■ • , afc € Ay, 



by computing the height k finite tree code <i> Q 



<J> afe (r). Here we use the natural analogue 



of (5.3 1 for finite tree codes. See also the diagrams in |BHS05, Figures 19,20] 



5.4. Constructing Tree Codes from Addresses and Conversely. 

Definition 5.13 (Addresses for V -variable V -tuples of tree codes). For each sequence a 



aoai . ..ak ■■ ■ with a^- G Ay and & ak as in (5.3 1, define the corresponding y-tuple o;° of tree 
codes by 

(5.6) 



uj a = «, ...,co v ):= lim $ a ° o $ ai o • • • o ...,lj v ), 



for any initial (w°, . . . : oj v ) G fi . 

The sequence a is called an address for the V- variable F-tuple of tree codes u) a . 
The set of all such addresses a is denoted by Ay . 

Note that the tree code $ a ° o $ Ql o • • • o $ Qfc . . . , u v ) is independent of (a;?, ... , <Jy) € Cl v 
up to and including level k, and hence agrees with u> a for these levels. 



The sequence in (5.6 1 converges exponentially fast since Lip<i> afc is < 1/M 
The map a u 



jj a : Ay — ► fiy is many-to-one, since the composition of different <i> a s may 
give the same map even in simple situations as the following example shows. 

Example 5.14 (Non uniqueness of addresses). The map a t— > uj a : A^ — > fly is many-to-one, 
since the composition of different <& a s may give the same map even in simple situations. For 
example, suppose 



M = 1, V = 2, F £ A, $ a = 
Since M = 1 tree codes here are infinite sequences, i.e. 1-branching tree codes. One readily 



F 1 
F 2 



checks from (5.3 1 that 

$ Q (cj, u)') = (F * uj, F * oj), $ b (w, w') = (F*uj,F*uj'), 
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and so 

$ Q o$ 6 (cj,lj') = $ a o$ a (w,w') = $ b o$ Q (cj,cj') = (F*F*w,F*F*w), 

$ b o $ b (w, J) = (F * F * u), F * F * J). 

The following definition is best understood from Example |5.17| 

Definition 5.15 ( Tree skeletons). Given an address a — a^a\ ... ... € Ay and v € {1, . . . , V} 

the corresponding tree skeleton J" : T — > {1, . . . , V"} is defined by 

J?(0)=i;, J?(mi) = J ao (« > mi), ^(mmn) = J ai (J"(m 1 ),m 2 ), 
JZ(mi...m k ) = J afc - I (J°(mi...m fc _i),mfc) ) ... , 



(5.7) 



where the maps J ak (v, m) are as in (5.5 1 



The tree skeleton depends on the maps J ak (w,m), but not on the maps I ak (w,m) and hence 
not on the set {F a }> g a of IFSs and its indexing set A. 

The F-tuple of tree codes (u" . . . , lj v ) can be recovered from the address a = aoai . . . . . . 

as follows. 

Proposition 5.16 (Tree codes from addresses). If a = a ai . . . df. . . . € Ap 1 is an address, I ak 
and J ak are as in ( |5.5[ ), and J™ is the tree skeleton corresponding to the J ak , then for each 
aeTandl<v<V, 

(5.8) <(cr) = I ak (J2(a)) where k = \a\. 

Proof. The proof is implicit in Example |5.17| A formal proof can be given by induction. □ 



F 3 (1) = G F 3 (2) = G F 3 (3)=F F 3 (4) = G J a3 (5) = G 



J-°2 (1,1) = 1 




J a °(l,2) = l 



F 2 (l) = 


F 




I a2 (2) 


= G 








— ^7 — r~ 




\ / 
\ / 










J al (l) = 


F 




I ai (2) 


= G 



J°D(l,l)=3 



J°°(1)=F I a «(2) = F F°(3) = G J°°(4)=F I a °(5) = G 



/as (4) 


= G 


1 

- =. J - 
.1 ■ 






2 (4,3) 


J a2 (4) 


= F 


K 


J 01 (4) 


= F 






\ \ 




J°°(4) 


= F 



J°=(5,l)=5 



J a2 (5) = G 



Figure 3. Constructing tree codes from an address. 



Example 5.17 (The Espalie^ technique). We use this to find tree codes from addresses and 
addresses from tree codes. 



1 espalier [verb]: to train a fruit tree or ornamental shrub to grow flat against a wall, supported on a lattice. 
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We first show how to represent an address a — a^a\ . . . ■ ■ ■ £ Ay by means of a diagram 
as in Figure [3] From this we construct the V- variable V-tuple of tree codes (u>i, . . . ,u>v) £ 
with address a. 

Conversely, given a V- variable y-tuple of tree codes {u)\, . . . , u>y) £ we show how to find 
the set of all its possible addresses. Moreover, given a single tree code ui G f2y we find all possible 
(w\, . . . ,u>v) S fly with U3\ = ui and all possible addresses in this case. 

For the example here let A = {F, G} where F and G are symbols. Suppose M = 3 and V = 5. 



Suppose a 
(5.9) 

~F 

F 
G 
F 
G 



o 



a ai 

1 
1 

3 
3 
4 



■ a k ■ 



£ Ay is an address of (a>i, . . . ,uy) € f2y, where 



"i 



F 
G 
G 
F 
F 



02 



F 
G 
F 
F 
G 



03 = 



G 
G 
F 
G 
G 



We will see that up to level 2 the tree codes 01% and u>2 are those shown in Figure [2] Although 
u>t and W2 are 3- variable and 2- variable respectively up to level 2, it will follow from Figure [3] 
that they are 5- variable up to level 3 and are not 4- variable. 

The diagram in Figure [3] is obtained from a = 00010203 ... by espaliering V copies of the 
tree T in Definition |4.1| up through an infinite lattice of V boxes at each level 0,1, 2, 3,... . One 
tree grows out of each box at level 0, and one element from A is assigned to each box at each 
level. When two or more branches pass through the same box from below they inosculate, i.e. 
their sub branches merge and are indistinguishable from that point upwards. More precisely, a 
determines the diagram in the following manner. For each level k and starting from each box 
v at that level, a branch terminates in box number J ak (v, 1) at level k + 1, a branch 

— — — terminates in box J ak (v,2) at level k + 1 and a branch terminates in box 

J ak (v, 3) at level k + 1. The element I ak (v) £ A is assigned to box v at level k. 

Conversely, any such diagram determines a unique address a — 00010203 ... . More precisely, 
consider an infinite lattice of V boxes at each level 0,1,2,3,... . Suppose at each level k there is 
either F or G in each of the V boxes, and from each box there are 3 branches , — — — 

and , each branch terminating in a box at level k + 1 . From this information one can 

read off I ak (v) and J ak (v, m) for each fc>0, 1 < v < V and 1 < m < M, and hence determine 
a. 

The diagram, and hence the address a, determines the tree skeleton J™ by assigning to each 
node of the copy of T growing out of box v at level 0, the number of the particular box in which 
that node sits. If u a = (oji, . . . ,u>v) is the T^-tuple of tree codes with address a then the tree 
code lo v is obtained by assigning to each node of the copy of T growing out of box v at level 
the element (fruit?) from A in the particular box in which that node sits. 

Conversely, suppose u> £ fly is a single V- variable tree code. Then the set of all possible 
u> £ fly C £l v of the form u> = (wi,tt>2, . . . , u>v) with u>i = u, and the set of all diagrams and 
corresponding addresses a £ Ay for such u>, is found as follows. 

Espalier a copy of T up through the infinite lattice with V boxes at each level in such a way 
that if <7o ■ • ■ °k and a' . . . a' k sit in the same box at level k then the sub tree codes u> [ctq . . . 
and w\a' ...a' k are equal. Since w is ^-variable, this is always possible. From level k onwards 
the two sub trees are fused together. 

The possible diagrams corresponding to this espaliered T are constructed as follows. For each 
a £ T the element w(er) £ A is assigned to the box containing a. By construction, this is the 
same clement for any two er's in the same box. The three branches of the diagram from this box 
up to the next level are given by the three sub branches of the espaliered T growing out of that 
box. If T does not pass through some box, then the F or G in that box, and the three branches 
of the diagram from that box to the next level up, can be assigned arbitrarily. 
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In this manner one obtains all possible diagrams for which the tree growing out of box 1 
at level is cu. Each diagram gives an address a G Ay as before, and the corresponding 
uj a = (wxj ■ ■ • , wv) G with address a satisfies loi — u>. 

In a similar manner, the set of possible diagrams and corresponding addresses can be obtained 
for any u> = (uj\, ui2, ■ ■ ■ , ^>v) G Qy C Sl v . 

5.5. The Probability Distribution on V- Variable Tree Codes. Corresponding to the 



probability distribution P on A in (1.2 1 there are natural probability distributions py on fly, 
&v on JCy and Tly on My. See Definition |5.1| for notation and for the following also note 
Definition 15.71 



Definition 5.18 (Probability distributions on addresses and tree codes). The probability dis- 



tribution Py on Ay, with notation as in (5.51, is defined by selecting a = (I, J) G Ay so that 
1(1), I(V) G A are iid with distribution P, so that J(l, 1), . . . , J(V, M) G {1, . . . , V} are iid 
with the uniform distribution {V -1 , . . . , V^ 1 }, and so the I(v) and J(w, m) are independent of 
one another. 

The probability distribution Py° on Ay, the set of addresses a = a$ai . . ., is defined by 
choosing the to be iid with distribution Py. 



The probability distribution p v on fl v is the image of Py under the map a i— > u> a in (5.6). 
The probability distribution py on fly is the projection of p v in any of the V coordinate 
directions. (By symmetry of the construction this is independent of choice of direction.) 

One obtains natural probability distributions on fractals sets and measures, and on V-tuples 
of fractal sets and measures as follows. 

Definition 5.19 (Probability distributions on V -variable fractals). Suppose (F x )\<z\ is a uni- 
formly contractive family of IFSs. 

The probability distributions & v and Ay on Ky and JCy respectively are those induced from 



p v and py by the maps (ui\, . . . ,ujy) t— > (K^ 1 , . . . ,K^ V ) and ui i— > in Definitions 5.7 and 5.1 
Similarly, the probability distributions 9Jl v and Wly on My and M.y respectively are those 
induced from py and py by the maps (ui, . . . , uiy) *—>■ (p Ul , . . . , fi uv ) and whji". 

That is, R v , M.y, and dJly are the probability distributions of the random objects 

(K Ul , . . . ,K Uv ), K^, (p Ul , . . . , /j? v ) and fi u respectively, under the probability distributions 
Py and py on (u>±, . . . ,u>y) and to. Since the projection of p v in each coordinate direction is py 
it follows that the projection of & v in each coordinate direction is &y and the projection of Wl v 
in each coordinate direction is DJly. However, there is a high degree of dependence between the 
components and in general p v ^ py v , &y ^ &y V and Wl v ^ Wly V . 



Definition 5.20 ((The IFS acting on the set ofV-tuples of tree codes)). The IFS 4>y in ( 5.4 1 
is extended to an IFS with probabilities by 

(5.10) *y := (n V ,<S> a ,aeAy,Py). 

Theorem 5.21. A unique measure attractor exists for <fry and equals p v . In particular, the 
projection of p v in any coordinate direction is py. 

Proof. For a £ Ay let R a : Ay -> Ay denote the operator a i-> a * a. Then (Ay,R a ,a G 
Ay,Py) is an IFS and the R a are contractive with Lipschitz constant 1/2 under the metric 
d(a,b) — 2~ k , where k is the least integer such that ^ bk- Thus this IFS has a unique 



attractor which from Definition 5.18 is Py° . 



it follows that <f> a 



Since each 4> a : fr — > Q v has Lipschitz constant X/M from Theorem 5.12 
has Lipschitz constant 1 /M in the strong Prokhorov (and Monge-Kantorovitch) metric as a map 
on measures. It also follows that &y has a unique attractor from Theorem |5 . 1 2| and Remark 3.6 



Finally, if II is the projection a i— » u) a : Ay — » £l v in (|5.6|, it is immediate that IIoi? a = <I> a oII 



and hence the attractor of &y is Tl(Py') = p v by Definition 5.18 



The projection of p v in any coordinate direction is py from Definition 5.18 □ 
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Remark 5.22 (Connection with other types of random fractals). The probability distribution 
py on fly is obtained by projection from the probability distribution Py on Ay, which is 
constructed in a simple iid manner. However, because of the combinatorial nature of the many- 
to-one map 



a i 



fly — > fly hi (5.6 1, inducing P, 



Pv Pv, 



4.1 



for 



the distribution py is very difficult to analyse in terms of tree codes. In particular, under the 
distribution py on Cly and hence on fi, the set of random IFSs uj i— > p w ( <T ) for cr e T has a 
complicated long range dependence structure. (See the comments following Definition 
the notation F^.) 

For each 1^-variable tree code there are at most V isomorphism classes of subtree codes at 
each level, but the isomorphism classes are level dependent. 

Moreover, each set of realisations of a V- variable random fractal, as well as its associated 
probability distribution, is the projection of the fractal attractor of a single deterministic IFS 



operating on ^-tuples of sets or measures. See Theorems 6.4 and 6.6 



For these reasons py, Ay and OTty are very different from other notions of a random fractal 
distribution in the literature. Sec also Remark kx91 



6. Convergence and Existence Results for SuperIFSs 
We continue with the assumption that F — {X, F x , A £ A, P} is a family of IFSs as in ( 1.1 1 



and (1.21 



The set K.y of V- variable fractal sets and the set M.y of V- variable fractal measures from 
Defi nition |5.1| together with their natural probability distribution Ry and DJly from Defini- 



tion 



5.19 



are obtained as the attractors of IFSs 



V vy 



v 



or 3y 1 under suitable conditions, 
see Theorems |6.4| and |6.6| The Markov chains corresponding to these IFSs provide MCMC 
algorithms, such as the "chaos game" , for generating samples of V^-variable fractal sets and In- 
variable fractal measures whose empirical distributions converge to the stationary distributions 
Ay and dRy respectively. 

Definition 6.1. The metrics dn, dp and dun are defined on C(X) V , M C (X) V and Mi(X) v 

by 

, K v ), (K[, . . .,K' V )) = maxd H (K v ,K' v ), 

V 

.,py), (p[, . . .,p' v )) = max dp (p v ,p' v ), 



(6.1) 



dp ((ill, 
duK ((pi, 



,Pv),(p[,---,Pv)) = V 1 ^2d M K(Pv,p / v ), 



where the metrics on the right are as in Section [2] 

The metrics du and duK are complete and separable, while the metric dp is complete but 
usually not separable. See Definitions |2 . 1[ |2~3] and [2~4| the comments which follow them, Propo- 
sition [275| and Remark |2.6J The metric dMK is usually not locally compact, see Remark |3.5| 



The following IFSs (6.3 1 are analogues of the tree IFS *y := (!!\$ a ,ae Ay) in (5.4 1 



Definition 6.2 (SuperlFS). For a 6 Ay as in Definition 5.10 let 

T a :C(X) V -^C(X) V , T a : M C (X) V -> M C (X) V , T a : Mi(X) v -> Mi(X) v 

be given by 
(6.2) 

T a (K x , ...,K V ) = (^(1,1), . . . , K Ja{lM) ),. . . , F r ^ (Kj a{VA) , . . 

T a (pi, . . . , Py) = (^F 1 (1) (/^ja( la ), . . . , /ija( ljA f)), - ..jF 1 (y) (Mja(y j l), . . . , fl Ja (y_ M ) ) J , 



where the action of F 1 ( v > is defined in (4.8 1 
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Let 

t v = (C(X) v ,T a ,aeA v ,Pv) 
(6.3) ^y c = {M c {X) v ,T a ,aeAy,P, 



v 



3y 1 - {M^Xf^^aeAv^Pv) , 
be the corresponding IFSs, with Py from Definition |5.18| These IFSs are called superlFSs. 



Two types of conditions will be used on families of IFSs. The first was introduced in (4.1 ). 

Definition 6.3 {Contr activity conditions for a family of IFSs). The family F = {X, F X ,X G 
A, P} of IFSs is uniformly contractive and uniformly bounded if for some < r < 1, 

(6.4) sup A max m d(/*(x),/^(y)) < rd(x,y) and sup A max m (a) , a) < oo 



for all x,y € X and some a G X. (The probability distribution P is not used in (6.4 1. The 
second condition is immediate if A is finite.) 

The family F is pointwise average contractive and average bounded if for some < r < 1 

(6.5) E A I^d(/*(x),/*(y))<rd(a;,y) and E A E m d(/*(o), a) < oo 

for all x, y G X and some sel. 



The following theorem includes and strengthens Theorems 15-24 from BHS05 . The space 
X may be noncompact and the strong Prokhorov metric dp is used rather than the Monge- 
Kantorovitch metric c?Mif • 

Theorem 6.4 ( Uniformly contractive conditions). Let F = {X, F x , A G A, P} be a finite family 
of IFSs on a complete separable metric space (X,d) satisfying (6.4). 

Then the superlFSs 3y and $ v c satisfy the uniform contractive condition Lip.7 ra < r. Since 
(C (X) v , d-j-c) is complete and separable, and (M C (X) V , dp) is complete but not necessarily sep- 
arable, the corresponding conclusions of Theorem \3.S\ and Remark \3.6\ are valid. 

In particular 

(1) 3y and 3v^ c each have unique compact set attractors and compactly supported separable 
measure attractors. The attractors are fC v and & v , and My and 9Jly ; respectively. 
Their projections in any coordinate direction are JCy , &y , My and 9Jly, respectively. 

(2) The Markov chains generated by the superlFSs converge at an exponential rate. 

(3) Suppose (K?, . . . , K v ) £ C{X) V . If a = a ai • • • G Af then for some (K u . . . , Ky) € 
JCy which is independent of (K^, . . . , K v ), 

T a " o • • • o T a »(Kl ...,K° V )^ (K lt ...,K V ) 

in (C(X) V , dn) as k — > oo. Moreover 



{T aa o • • • o F»(K°, ...K v ):a ,...,a k eAy}^JC 



v 



in (C(C(X) V , d-j-c), d-H) . Convergence is exponential in both cases. 

Analogous results apply for My, & v and 9Jl v , with dp or du as appropriate. 
(4) Suppose B° = . . . ,B V ) G C(X) V and a = a ai • • • G Ay and let B k (a) = B k = 
T ak {B k ^) ifk>\. Let B k (a) be the first component of B k (a). Then for a.e. a ana 
every B° , 

(6.6) £5£>.(.,->*v 

weakly in the space of probability distributions on (C(X),dft). 

For starting measures (fi®, . . . , fi v ) G M C (X) V , there are analogous results modified 
as in Remark 3.6 to account for the fact that (M c (X),dp) is not separable. 

There are similar results for V -tuples of sets or measures. 
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Proof. The assertion Lip T a < r follows from (6.1 1 by a straightforward argument using Defini- 
tions 2.1 and 2.4 equation (2.4 1 and the comment following it, and equations (|4.7|), (6.1l and 



(6.2). So the analogue of the uniform contractive condition (3.131 in Theorem 3.2 is satisfied 



3.6 



tf v and £y « 



while the uniform boundedness condition is immediate since F is finite. From Theorem I3.2l d 

each has a unique set attractor which is a subset of C(X) V 



and Remark 

and A4 C (X) respectively, and a measure attractor which is a probability distribution on the 
respective set attractor 

It re mains to identify the attractors with the sets and distributions in Definitions 5.1 5.7 
and 15.19 



5.18 



(6.7) 



CO i 



Let II denote one of the maps 

^p", (u 1 ,...,u Jv )^(K^,...,K^), (oj u ...,oj v ) (p"\...,p" v ), 



depending on the context. In the last two cases it follows from (5.3 I and (6.2 ) that LIo<f> a = jF a oII 



Also denote by II the extension of II to a map on sets, on V-tuples of sets, on measures or on 



V-tuples of measures, respectively. It follows from Theorem 3.2 d together with Definitions 5.1 
l5~7l and I5T81 that 



(6.8) 



K,y 
M*y 



nTO, tc v 
fi(n* v ),Mv 



n(fi v ) 



IL{p* v ), & v 
U{p* v ), Wlv 



H(py), 

fi(p v ). 



The rest of (i) follows from Theorems 5.12 and |5.21 



The remaining parts of the theorem follows from Theorem 3.2 and Remark 3.6 



□ 



Remark 6.5 ( Why use the dp metric?). For computing approximations to the set of V-variable 
fractals and its associated probability distribution which correspond to F , the main part of the 
theorem is (iv) with either sets or measures. The advantage of (M. C (X), dp) over (Ai c (X), dMx) 
is that for use in the analogue of (3.21 ) the space BC(M c (X),d P ) is much larger than BC(M C (X), dMx)- 
For example, if 4>{p) = ip(dp(p, Pi)) , where p\ S M. C {X) and ip is a continuous cut-off approxi- 
mation to the characteristic function of [0, e] C K, then (j> G BC(A4 C (X), dp) but is not continuous 
or even Borel over (A^ c (l),4/jf)- 

Theorem 6.6 (Average contractive conditions). Let F — {X, F , A e A. P} be a possibly infinite 
family of IFSs on a complete separable metric space (X,d) satisfying (6.5). 



Then the superlFS Sy^ 1 satisfies the pointwise average contractive and average boundedness 



conditions 



(6.9) 



E a d MK (F a {iJ,), < rd MK (n,n'), E a 4«(r(/i )^ )) < oo 



for all ft, ft' G A4i(X) v and some /j, G M.\(X) V . Since (Aii(X) v , dMn) is complete and 
separable, the corresponding conclusions of Theorem \3.S\ are valid. 
In particular 

(1) S^ 1 has a unique measure attractor and its projection in any coordinate direction is the 
same. The attractor and the projection are denoted by Tt v and 9Jty, and extend the 



corresponding distributions in Theorem 6.4 



(2) Fora.e. a = o Q Oi • ■ • G Af, if Qu?, . . . , pjf) G Mi{X) v then T aa o • • • o T a " (p\, . . . , p° v ) 
converges at an exponential rate. The limit random V -tuple of measures has probability 



distribution 971 



(3) If a 



v ■ 



a ai 



■■ e A$> and fi° = (pV, . . . , p° v ) G Mx{X) v let fi k (a) = ^ a "(fi k ~ 1 ) /< 



k > 1. Let p k (a) be the first component of fj, k (a). Then for a.e. a and every fi°. 



(6.10) 



1 y~yk-l 



Tlx. 



1 n °P"(a) 
k *■ — 'n=0 • ' 

weakly in the space of probability distributions on {M.i{X),dMK)- 



FRACTALS WITH PARTIAL SELF SIMILARITY 



23 



Proof. To establish average boundedness in (6.9 1 let /x° = (8b, ■■■,5b) € Aii(X) v for some 
tel. Then 

E a d M K(F a {Sb, ■ ■ ■ , $b), (5b, ■ ■ - ,5b)) 

= E a yY. dMK {Y. w ™ (v)S f!: M (by S b) from S @, (JTT} and ((481 



(fc)'" b . 

- Eq mi w C ( - v)d MK(5 f ^(v) {b y5 b ) by basic properties of d 

u m 

= Ea t7 X! X! w ln {v)d (fm {v) ( b ) > & ) basic properties of d MK 



= E x^J2Yl 6 ) since d ^tl a (v) = P = dist A by Definition pU8 



v m 



= E x E m d(f,^(b),b) < oo. 



To establish average contractivity in ( 6.9 > let (fii, . . . (u>i, ■ ■ ■ ,1^'v) e M.\(X) V . Then 

E a d MK (T a (^, nv),F a (»'i,- ■ ■ , lAr)) 



from (6.21, (1.1 1 and (4.8 1 



^ E <4 E E w ™ ( "Wf* (/r ( "W(«,m)), /™ by properties of 

= Ea E E W " 1 Eq dMK(fm(f J -J a (v,m)), fm(t*>'j°(v,m))) 



by the independence of 7 a (w) and J a (v, n) in Definition 5.18 and since dist I a (v) = P = dist A 

= E ^ E E w ™ E * d M^(/™(M t ), fM)) 

v m 

by the uniform distribution of J a (v 7 m) for fixed (v,m) where t is distributed uniformly over 

{i,...,v} 

= E t E\ E m d MK {ti(lH),fM))' 

Next let W t ,W[ be random variables on X such that dist Wt — fit, distW/ = fi' t and 
Ed(Wt, W/) = duK^t, Mt)i where E without a subscript here and later refers to expecta- 
tions from the sample space over which the Wt and W[ are jointly defined. This is possible 
by |Dud02| Theorem 11.8.2]. Then 

E t E\ E m dAfA'(/m(ft)i /ra(/4)) 

< Et E A E m E d(/* (W t ), /* (W?)) by the third version of I 



< r Ef E d(W t , W^ from ( 6.5 ) 



r E t duK^t, Mt) by choice of Wt and 
rd M K((m, ■ ■ .,yv), (v'i, ■ ■ ■ jMv))- 



This completes the proof of (|6.9|) . The remaining conclusions now follow by Theorem 3.2 □ 



Remark 6.7 (Global average contractivity) . One might expect that the global average contractive 
condition Ea E to Lip < 1 on the family F would imply the global average contractive condition 
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E Q LipJF a < 1, i.e. would imply that the superlFS is global average contractive. However, 
this is not the case. 

For example, let X = M, V = 2 and M = 2. Let F contain a single IFS F = (/i,/ 2 ; 1/2, 1/2) 
where 

^ c ^ 3 1 + e 1 - e 

Aw = -2^ -' 2 ^ = ~Y~ ~Y~ x ' 

Then Lip/i = 3/2, Lip/2 = (1 — e)/2 and E m Lip/ m = 1 — e/4. So F is global average 
contractive. 

Note /i(0) = and /2(1) = 1. Let fi — (5 ai S ), ^ — (Si, Si) and note d,MK(l*, A*') = 1- Then 
for any a £ Ay as in (5.5 1, 

1 



T a (n) 



L 1. 1. 1. 
2*"i + 2* 1 ' 2*"i + 2 dl 



From the first form of (|2J) with /(x) = |x| and using f6T| , d M K(F a (n), F a (rf)) > 1 - e/4 and 
so Lip JF a > 1 — e/4 for every a. 

Tf 1 2l 

Next let a* = „ _ and choose fi = (So, So), fi' = (Si,Sq), so fi and fi' differ in the box 
t 1 2 

on which /1 always acts and agree in the box on which fa always acts. Note dMK(^, /-*') = 1/2- 
Then 



1. 1, l r l r 

-0_3+-01 + e, -0^3 + + « 

2 2 2 2 2 2 2 s 



Again using the first form of ([2T2J) with fax) = |x|, it follows that d MK (F a * (fJ.),F a " (/*')) > 3/4, 
so Lip.F a * > 3/2. 

Since there are 16 possible maps a S .Ay, each selected with probability 1/16, it follows that 



E a Lip^ a > i|( I 



13,., 2 

Te'^ 1 lf e< i5- 



So for such < e < 2/15 the IFS S'y 1 is not global average contractive. But since E m Lip f m = 



3.2 



1 — e/4 it follows from Theorem 6.6 that 3y 1 is pointwise average contractive, and so Theorem 
can be applied. 

Example 6.8 (Random curves in the plane). The following shows why it is natural to consider 
families of IFSs which are both infinite and not uniformly contractive. Such examples can be 
modified to model Brownian motion and other stochastic processes, see |Gra91[ Section 5.2] and 



HROOb pp 120-122]. 



Let F = {R 2 ,F X ,X e R 2 ,N(0,a 2 I)} where F x = {/ X A , #; 1/2, 1/2} and N(0,a 2 I) is the 
symmetric normal distribution in M 2 with variance a 2 . The functions f x and f£ are uniquely 
specified by the requirements that they be similitudes with positive determinant and 

#(-1,0) = (-1,0), / A (1,0) = A, / A (-1,0)=A, #(1,0) = (1,0). 

A calculation shows er = 1.42 implies E> E m Lip sa 0.9969 and so average contractivity holds 
if a < 1.42. If |A| is sufficiently large then neither f x nor # are contractive. 

The IFS F x can also be interpreted as a map from the space C([0, 1],R 2 ), of continuous paths 
from [0, 1] to M 2 , into itself as follows: 



(F\0))(t) 



fH<t>(ty) 0<t<§, 
J 2 A (0(2t- 1)) §<*<!. 



Then one can define a superlFS acting on such functions in a manner analogous to that for the 
superlFS acting on sets or measures. Under the average contractive condition one obtains L 1 
convergence to a class of U-variable fractal paths, and in particular U-variable fractal curves, 
from (—1,0) to (1,0). We omit the details. 
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Remark 6.9 (Graph directed fractals). Fractals generated by a graph directed system [GDS] or 
more generally by a graph directed Markov system [GDMS], or by random versions of these, 
have been considered by many authors. See 01s94 MU03] for the definitions and references. We 



comment here on the connection between these fractals and ^-variable fractals. 

In particular, for each experimental run of its generation process, a random GDS or GDMS 
generates a single realisation of the associated random fractal. On the other hand, for each run, 
a superlFS generates a family of realisations whose empirical distributions converge a.s. to the 
probability distribution given by the associated Invariable random fractal. 

To make a more careful comparison, allow the number of functions M in each IFS F x as in 
(1.1) to depend on A. A GDS together with its associated contraction maps can be interpreted 



as a map from ^-tuples of sets to ^-tuples of sets. The map can be coded up by a matrix a as 



in (5.5 1, where M there is now the number of edges in the GDS. 



If V is the number of vertices in a GDS, the V-tuple (K 1 , . . . , K v ) of fractal sets generated 
by the GDS is a very particular V- variable U-tuplc. If the address a = ao&i . . . ak ■ ■ ■ for 
(K 1 , . . . , K v ) is as in ( |5.6[ ), then = a for all k. Unlike the situation for V- variable fractals as 
discussed in Remark |5.2[ there are at most V distinct subtrees which can be obtained from the 
tree codes uj v for K v regardless of the level of the initial node of the subtree. 



More generally, if (K 1 , . . . , K v ) is generated by a GDMS then for each k, 



is deter- 



mined just by au and by the incidence matrix for the GDMS. Each subtree lu v \a is completely 
determined by the value ui v (a) £ A at its base node a and by the "branch" in a = a\ . . . a/.. 

Realisations of random fractals generated by a random GDS are almost surely not ^-variable, 
and are more akin to standard random fractals as in Definition 14.51 One comes closer to In- 
variable fractal sets by introducing a notion of a homogeneous random GDS fractal set analogous 
to that of a homogeneous random fractal as in Remark |9.4| But then one does not obtain a class 
of V- variable fractals together with its associated probability distribution unless one makes the 
same definitions as in Section [5] This would be quite unnatural in the setting of GDS fractals, 
for example it would require one edge from any vertex to any vertex. 

7. Approximation Results as V — ► oo. 



Theorems 7.1 and 7.3 enable one to obtain empirical samples of standard random fractals 



up to any prescribed degree of approximation by using sufficiently large V in Theorem 6.4 iv) 



This is useful since even single realisations of random fractals are computationally expensive to 
generate by standard methods. Note that although the matrices used to compute samples of 
V- variable fractals are typically of order V x V, they are sparse with bandwidth M. 



The next theorem improves the exponent in BHS05 . Theorem 12] and removes the dependence 
on M. The difference comes from using the third rather than the first version of (2.2 1 in the 
proof. 

Theorem 7.1. If cLmk is the Monge-Kantorovitch metric then d^K {PViPaa) < 1.4 V -1 ' 3 . 
Proof. We construct random tree codes Wy and Woo with dist Wy — py and dist Woo — Poo . In 



order to apply the last equality in (2.2 1 we want the expected distance between Wy and W a 



determined by their joint distribution, to be as small as possible. 

Suppose A = A0A1A2 ■ ■ ■ is a random address with dist A = Py°. Let Wy = ^(<j) be 
the corresponding random tree code, using the notation of (5.8) and (5.7 1. It follows from 
Definition |5.18| that dist Wy = py . 

Let the random integer K = K(A) be the greatest integer such that, for < j < K, if 



= \a'\ — j and a ^ a' then Ji(a) ^ J\{&') in (5.7|. Thus with v = 1 as in Example 



5.17 



the nodes of T are placed in distinct buffers up to and including level K. 
Let Woo be any random tree code such that 

if \a\ < K then W^a) = W v (a), 

if \a\ > K then dist Woo (cr) = P and Woo(c) is independent of Woo(ct') for all a' ^ a. 
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It follows from the definition of K that Woc(a) are iid with distribution P for all a and so 
dist Woo = Poo- 

For any k and for V > M k , 

Ed(WV,Woo) 

= E(d(W v , Woo) | K > k) ■ Vxob{K > k) + E(d(W v , W^) \ K < k) ■ Vmb{K < k) 
— Prob(if < k) by Q and since W v {%) = W a 

M k -1 

n 




since II" =1 (1 — a^) > 1 — Yl7=i ai ^ or a » — ^ 

< — J— r + — *— (M 2 + M 4 + • • • + M 2k ) 

1 Af2(fc+i) x 2M 2fc - 1 

- M fe +! + 2MF(M 2 - 1) - M fe +! + 31^ ' 




assuming M > 2 for the last inequality. The estimate is trivially true if M = 1 or V < M . 

Choose x so M x — (^|-) 1 3 , this being the value of a; which minimises M l + i + — ■ Choose 
ksok<x<k + l. Hence from (2.2 1 

d M K(pv,Poc) < Ed{W v ,W 00 ) < 

□ < V- 1 '* V + ^ ( t ) 1 < 1.371/- 1 / 3 . 

□ 

Remark 7.2 (#0 analogous estimate for dp is possible in Theorem \7.1\ . The support of /9y 
converges to the support of in the Hausdorff metric by Theorem |5.5| However, pv Poo in 
the dp metric as V — > 00. 

To see this suppose M > 2, fix j, k € {1, . . . , M} with j ^ k and let i? = {lu e f2 : cj(j) = 
w(fc)}, where j and k are interpreted as sequences of length one in T. According to the probability 
distribution p^, uj(j) and oj{k) are independent if j ^ k. For the probability distribution py 
there is a positive probability l/V that J(l,j) = J(l, k), in which case u>i(j) = wi(k) must be 
equal from Proposition 5.16| while if J(l,j) ^ J(l,fc) then uji(j) and uii(k) are independent. 
Identifying the probability distribution pv on f2 with the projection of p v on Q y in the first 
coordinate direction it follows Poo(E) < pv(E). 

However, d(oj',E) > 1/M if u>' ^ E since in this case for lj E E either o-i'(j) ^ uj(j) or 
Lu'(k) ^ uj(k). Hence for e < 1/M, E c = E and so Poo(E e ) = p^E) < p v {E). It follows that 
dp(pv,Poo) > 1/M for all V if M > 2. 



Theorem 7.3. Under the assumptions of Theorem 6.4 



2L 

dniJCv,^), dniMviMoo) < y^V^, 

(7.1) r 

2 8 £ - 

), rfMK(9^y,9^oo) < 1 V Q , 

1 — r 
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whe 



L — supmaxd(/ m (a),a), 

A m 



log(l/r) 
logM 



a/3 ifa<l, 5=l/3i/a> 1. 



Proof. The first two estimates follow from Theorem 5.5 and Proposition 4.3 



For the third, let Wy and Woo be the random codes from the proof of Theorem 7.1 In 
particular, 

(7.2) p v = dist W Vl Poo = dist Woo, Ed(W Vl Wx>) < IAV~ 1/3 . 

Let II be the projection map i— > K u given by (4.3) and ( |6.8| . Then &v = distil o Wy, 
.Soo = distil o Woo, and from the last condition in (2.2 1 



(7.3) 



duK^v^oo) <Erf«(no wv,noWoo). 



If a < 1 then from (4.6) on taking expectations of both sides, using Holder's inequality and 



applying (7.2), 



2 T 2 8 J 

Ed w (no wv,no Woo) < 3 — Ed Q (w y ,Woo) < - — v- a/3 . 



i 



i 



If a > 1 then from the last two lines in the proof of Proposition (4.3), 

2L 



1 



■d a {u,J), 



and so, using this and arguing as before, 



Ed H (Uo Wy,no Woo) < 



2.8L 

1 - r 



This gives the third estimate. The fourth estimate is proved in an analogous manner. 



□ 

Sharper estimates can be obtained arguing directly as in the proof of Theorem |7.1| In par- 

log(l/r) 

ticular, the exponent a can be replaced by ; — ttttti x ■ 

log(M"yr) 

8. Example of 2- Variable Fractals 

Consider the family F = {R 2 , U,D, |, |} consisting of two IFSs U = (/i,/2) (Up with a 
reflection) and D = (31,32) (Down) acting on M 2 , where 

'x %_JL^ x _ 3y 9 \ )-(^_3y,l x_3y 

.2 8 16' 2 8 16/' l2 ^ x,y) \2 8 16' 2 8 16/ 

'x 3y 1 x 3y 7 \ , . fx 3y 9 x 3w 1 

- + — , 1 — - H J, ff2 = I H , - + — 

,2 8 16 2 8 16/ y v ' y; V2 8 16 2 8 16/ 

The corresponding fractal attractors of U and F are shown at the beginning of Figure 4. The 

probability of choice of U and D is g in each case. 

The 2-variable supeiTFS acting on pairs of compact sets is 5^ = (C(M 2 ) 2 , T a : a € Ai^P-i)- 

There are 64 maps a € ^2, each a 2 x 3 matrix. The probability distribution P2 assigns 

probability ^ to each a ^ A 2 . 

\U 2 1" 



fi(x,y) 
9i(x,y) 



In iteration 4 in Figure 4 the matrix is a = ^ ^ 2 ■ Applying •^ ra to the pair of sets 
(Ei,E 2 ) from iteration 3 gives 

^ a (E 1 ,E 2 ) = (U(E 2 ,E 1 ),U(E 1 ,E 2 )) = (/i(E 2 )U/ 2 (£i),/i(£i)U/ 2 (E 2 )). 

The process in Figure 4 begins with a pair of line segments. The first 6 iterations and iterations 
23-26 are shown. After about 12 iterations the sets are independent of the initial sets up to 
screen resolution. After this the pairs of sets can be considered as examples of 2-variable 2-tuplcs 
of fractal sets corresponding to F. 

The generation process gives an MCMC algorithm or "chaos game" and acts on the infinite 
state space (C(M 2 ) 2 , du) of pairs of compact sets with the du metric. The empirical distribution 
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% O ^ 

U (above) and D (below) at tractors 



Initial Sets 



1: D(l,2) 



17(2, 2) 



2: £7(2,2) 



f/(l,l) 



3: 



U(l,2) 



4: [7(2, 1) 



[7(1,2) 



5: D(l,2) 



[7(1, 2) 



^y v&y 



23: £>(1,2) 



D(2, 2) 



6: D(l,2) 



24: £7(2,1) 



[7(2,2) 




0(2,1) 




— - 



2 jp* ^-C^ 



25: 0(1,2) 



L7(l,2) 



26: £>(1,2) 



0(2,2) 



Figure 4. Sampling 2-variable fractals. 
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along a.e. trajectory from any starting pair of sets converges weakly to the 2- variable superfractal 
probability distribution on 2-variablc 2-tuples of fractal sets corresponding to F. The empirical 
distribution of first (and second) components converges weakly to the corresponding natural 
probability distribution on 2-variable fractal sets corresponding to F. 

9. Concluding Comments 



Remark 9.1 (Extensions). The restriction that each IFS F x in (1.1) has the same number M of 
functions is for notational simplicity only. 

In Definition |5.18| the independence conditions on the / and J may be relaxed. In some 
modelling situations it would be natural to have a degree of local dependance between (I(v), J(v)) 
and (I(w),J(w)) for v "near" w. 

The probability distribution py is in some sense the most natural probability distribution on 
the set of ^-variable code trees since it is inherited from the probability distribution Py° with 
the simplest possible probabilistic structure. We may construct more general distributions on 
the set of V- variable code trees by letting Py° be non-Bernoulli, e.g. stationary. 



Instead of beginning with a family of IFSs in (1.2 1 one could begin with a family of graph 
directed IFSs and obtain in this manner the corresponding class of ^-variable graph directed 
fractals. 

Remark 9.2 (Dimensions). Suppose F = {R n ,F x ;X e A.P} is a family of IFSs satisfying the 
strong uniform open set condition and whose maps are similitudes. In a forthcoming paper we 
compute the a.s. dimension of the associated family of ^-variable random fractals. The idea is 
to associate to each a £ Ay a certain V x V matrix and then use the Furstenberg Kesten theory 
for products of random matrices to compute a type of pressure function. 

Remark 9.3 (Motivation for the construction of V -variable fractals) . The original motivation was 
to find a chaos game type algorithm for generating collections of fractal sets whose empirical 
distributions approximated the probability distribution of standard random fractals. 



More precisely, suppose F = {(X, d),F x ,\ e A,P} is a family of IFSs as in ( |1.2| . Let V 
be a large positive integer and S be a collection of V compact subsets of X, such that the 
empirical distribution of S approximates the distribution .Soo of the standard random fractal 
associated to F by Definition |4.5[ Suppose S* is a second collection of V compact subsets of 
X obtained from F and S as follows. For each v € {1, ...,V} and independently of other 
w € {1, . . . , V}, select E\, . . . Em from S according to the uniform distribution independently 
with replacement, and independently of this select F x from F according to P. Let the wth set 
in S* be F x (E ll . . .Em) — Ui< m <M fmi^m)- Then one expects the empirical distribution of 
S* to also approximate &oo. 

The random operator constructed in this manner for passing from S to S* is essentially the 
random operator T a in Definition |6.2| with a <E Ay chosen according to Py. 



Remark 9.4 (A hierarchy of fractals). See Figure [I] 

If M = 1 in ( |1.1[ ) then each F x is a trivial IFS (/ A ) containing just one map, and the family 



F in (1.2 1 can be interpreted as a standard IFS. If moreover V = 1 then the corresponding 
super IFS in Definition |6. 2| can be interpreted as a standard IFS operating on (X, d) with set and 
measure attractors K and fi, essentially by identifying singleton subsets of X with elements in X. 
For M = 1 and V > 1 the superlFS can be identified with an IFS operating on X v with set and 
measure attractors K v and n v . Conversely, any standard IFS can be extended to a superlFS 
in this manner. The projection of K v in any coordinate direction is K, but K v ^ K . The 
attractors K v and \iy are called correlated fractals. The measure (i* provides information on a 
certain "correlation" between subsets of K. This provides a new tool for studying the structure 
of standard IFS fractals as we show in a forthcoming paper. 

The case V — 1 corresponds to homogeneous random fractals and has been studied in 



Ham92 Kif95 SteOlb . The case V — + oo corresponds to standard random fractals as defined in 
Definition 4.5 see also Section [7] See also Asa98 for some graphical examples. 
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For a given class F of IFSs and positive integer V > 1, one obtains a new class of fractals 
each with the prescribed degree V of self similarity at every scale. The associated superlFS 
provides a rapid way of generating a sample from this class of V- variable fractals whose empirical 
distribution approximates the natural probability distribution on the class. 

Large V provides a method for generating a class of correctly distributed approximations to 
standard random fractals. Small V provides a class of fractals with useful modelling properties. 
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